SEQXXENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: Greenspan, Daniel S 
Takahara, Kazuhiko 
Hoffman, Guy G 

10 (ii) TITLE OF INVENTION: Mammalian Tolloid-Like Protein 

(iii) NUMBER OF SEQUENCES: 13 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Quarles & Brady 

15 (B) STREET: 1 South Pinckney Street 

(C) CITY: Madison 

(D) STATE: WI 

(E) COUNTRY: US 

(F) ZIP: 53703 

-^0 (v) COMPUTER READABLE FORM: 

^ (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patent In Release #1.0, 
j^5 Version #1.30 

y (vi) CURRENT APPLICATION DATA: 

rn (A) APPLICATION NUMBER: 

g-: (B) FILING DATE: 



(C) CLASSIFICATION: 



&0 (viii) ATTORNEY/ AGENT INFORMATION: 

® . (A) NAME: Berson, Bennett J 

fy (B) REGISTRATION NUMBER: 37 094 

.^^ (C) REFERENCE/DOCKET NUMBER: 960296.93839 

O (ix) TELECOMMUNICATION INFORMATION: 

^ (A) TELEPHONE: 608-251-5000 

(B) TELEFAX: 608-251-9166 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 0 base pairs 
4 0 (B) TYPE: nucleic acid 

(C) STRANDEDX^ESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION; /desc = "oligonucleotide probe" 

4 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

ACGTCCAGAC CGGAGCGGGT GTGGCCCGAT GGGGTCATCC CGTTTGTGAT TGGAGGGAAT 6 0 

TTCACAGGCA GCCAGAGGGC AGTCTTCCGG CAGGCCATGA GACACTGGGA GAAGCATACC 120 

TGTGTCACCT TCTTGGAGCG CACAGATGAG GACAGCTATA TTGTATTCAC CTACCGACCC 180 

TGCGGGTGCT GCTCCTACGT GGGTCGCCGA GGTGGGGGCC CCCAGGCCAT CTCCATCGGC 24 0 

50 AAGAACTGTG ACAAGTTTGG CATCGTGGTC CATGAGCTGG GCCATGTCAT TGGCTTCTGG 300 

CACGAGCACA CGCGGCCCGA CCGCGACCGC 330 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4771 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
10 (B) LOCATION: 611.. 3652 

(D) OTHER INFORMATION: /products "murine mTll protein" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 
CACACCCCTT TGCTCTCCGG GCAGTCGGGA GCTTCCCTAG CTTCGGCAGG CTTTTAAGGT 60 
CTGGCGGCGT AGA7VATGCCT ATCCCCCACC CCCTTCCTCG GTCTCCCCTT TCAGTTCAGA 120 



Q 



TGTGCTGATG TGCAGACCGG ATTCATCTTC CCCGAGCAGC GGCGGTGGCA GCGGCGGGCG 180 



CAGGCGGCTG CAGCTCGCTC TCGGCCGCGG GGTCCTGACA GCGGCGGGGG CGCGGCGCGG 240 



^ GAGCCGGAGC TCCGGTGGCA GCTGAGCCCG CCGTGCGCCT CTCGCCGCGG CCGGTCGTGA 300 

01 TCGCGGGAAG TTCGACCGCT GGAAGGACGA CCTAGACCGA GCCGGGTTGG CTGCGGCTGC 360 

p CCTGCGCCGA GCTCCTCACC TGCCTTCCGC CCACCCGCGG GCCCCCGGCC AAGTTCCCCA 420 

GCATCCGGGG GAGACAGGGA GACATTTGCC CTCTCTAGCG TCCTGAAGAC ATCCGCATGT 480 

.Ji CTCCGGACAC CTGAACATTC AGGTCTTTCC GAGGAGCTTC GCAGTCGGGA TAAGAACACT 540 

y GTCCCTAGAG CCCCGCATAT CCACGCGGCC CTCCGGGTCT GGTCCCCTCC TTTTCCTCTA 600 

5 

GGGGAGGAGG ATG GGT TTG CAA GCG CTC TCC CCG AGG ATG CTC CTG TGG 649 
Met Gly Leu Gin Ala Leu Ser Pro Arg Met Leu Leu Trp 
25 15 10 

TTG GTG GTC TCG GGT ATT GTT TTC TCC CGG GTG CTG TGG GTC TGC GCT 697 
Leu Val Val Ser Gly lie Val Phe Ser Arg Val Leu Trp Val Cys Ala 
15 20 25 

GGC 'CTC GAT TAT GAT TAC ACT TTT GAT GGG AAC GAA GAG GAC J^^AA ACG 745 
3 0 Gly Leu Asp Tyr Asp Tyr Thr Phe Asp Gly Asn Glu Glu Asp Lys Thr 

30 35 40 45 

GAG CCT ATA GAT TAC AAG GAC CCG TGC AAA GCT GCT GTG TTT TGG GGT 793 
Glu Pro lie Asp Tyr Lys Asp Pro Cys Lys Ala Ala Val Phe Trp Gly 
50 55 60 

35 GAC ATC GCC TTA GAT GAT GAA GAC TTA AAT ATC TTC CAA ATA GAC AGG 841 

Asp lie Ala Leu Asp Asp Glu Asp Leu Asn He Phe Gin He Asp Arg 
65 70 75 

ACA ATT GAC CTG ACC CAG AGC CCC TTT GGA AAA CTT GGA CAT ATT ACA 88 9 
Thr He Asp Leu Thr Gin Ser Pro Phe Gly Lys Leu Gly His He Thr 
40 80 85 90 

GGT GGC TTT GGA GAC CAT GGC ATG CCA AAG AAG CGA GGG GCA CTC TAC 93 7 
Gly Gly Phe Gly Asp His Gly Met Pro Lys Lys Arg Gly Ala Leu Tyr 
95 100 105 



-25- 



CAA CTT ATA GAG AGO ATC AGA AGA ATT GGC TCT GGC TTG GAG CAA AAT 985 
Gin Leu lie Glu Arg lie Arg Arg lie Gly Ser Gly Leu Glu Gin Asn 
110 115 120 125 

AAC ACG ATG AAG GGA AAA GCA OCT CCA AAA TTG TCA GAG CAA AGT GAG 1033 
5 Asn Thr Met Lys Gly Lys Ala Pro Pro Lys Leu Ser Glu Gin Ser Glu 

130 135 140 

AAA AAT CGA GTT CCC AGA GCT GCT ACC TCA AGA ACG GAA AGG ATA TGG 1081 
Lys Asn Arg Val Pro Arg Ala Ala Thr Ser Arg Thr Glu Arg lie Trp 
145 150 155 

10 CCT GGG GGT GTC ATT CCT TAT GTC ATA GGA GGA AAC TTT ACT GGC AGC 1129 

Pro Gly Gly Val He Pro Tyr Val He Gly Gly Asn Phe Thr Gly Ser 
160 165 170 

CAG AGA GCC ATG TTC AAG CAG GCC ATG AGA CAC TGG GAA AAG CAC ACC 1177 
Gin Arg Ala Met Phe Lys Gin Ala Met Arg His Trp Glu Lys His Thr 
15 175 180 185 

TGT GTG ACG TTC ACT GAG AGA AGT GAT GPJK GAA AGT TAT ATT GTG TTC 1225 
D Cys Val Thr Phe Thr Glu Arg Ser Asp Glu Glu Ser Tyr He Val Phe 

190 195 200 205 

^ ACC TAC AGG CCT TGT GGA TGC TGC TCC TAT GTT GGT CGG CGG GGA AAT 1273 

^0 Thr Tyr Arg Pro Cys Gly Cys Cys Ser Tyr Val Gly Arg Arg Gly Asn 

rU 210 215 220 

ff^ GGC CCT CAG GCC ATC TCT ATT GGC AAG AAC TGT GAC AAG TTT GGA ATT 1321' 

Gly Pro Gin Ala He Ser He Gly Lys Asn Cys Asp Lys Phe Gly He 
225 230 235 

g|5 GTT GTT CAT GAA CTG GGC CAC GTG ATA GGC TTC TGG CAT GAA CAT ACC 1369 

Val Val His Glu Leu Gly His Val He Gly Phe Trp His Glu His Thr 

r: 240 245 250 

U \ 

t •XT 

CGC CCA GAC CGA GAC AAC CAT GTC ACC ATC ATT AGA GAG AAC ATC CAG 1417 

g Arg Pro Asp Arg Asp Asn His Val Thr He He Arg Glu Asn He Gin 

So 255 260 265 

CCA GGT CAA GAG TAC AAT TTT CTA AAG ATG GAG CCT GGA GAA GTG AAC 1465 
Pro Gly Gin Glu Tyr Asn Phe Leu Lys Met Glu Pro Gly Glu Val Asn 
270 275 280 285 

TCT CTT GGG GAA AGA TAT GAT TTT GAC AGT ATC ATG CAC TAC GCC AGG 1513 
3 5 Ser Leu Gly Glu Arg Tyr Asp Phe Asp Ser He Met His Tyr Ala Arg 

290 295 300 

AAC ACC TTC TCA AGA GGG ATG TTT TTA GAC ACA ATA CTC CCC TCC CGT 1561 
Asn Thr Phe Ser Arg Gly Met Phe Leu Asp Thr He Leu Pro Ser Arg 
305 310 315 

40 GAT GAT AAT GGC ATT CGT CCT GCA ATT GGT CAA CGG ACC CGG TTA AGC 1609 

Asp Asp Asn Gly He Arg Pro Ala He Gly Gin Arg Thr Arg Leu Ser 
320 325 330 

AAA GGA GAC ATT GCA CAA GCA AGA AAG CTG TAT CGA TGC CCA GCA TGT 16 57 
Lys Gly Asp He Ala Gin Ala Arg Lys Leu Tyr Arg Cys Pro Ala Cys 
45 335 340 345 

GGA GAA ACC CTG CAA GAA TCC AGT GGC AAC CTT TCT TCC CCA GGA TTC 17 05 
Gly Glu Thr Leu Gin Glu Ser Ser Gly Asn Leu Ser Ser Pro Gly Phe 
350 355 360 365 

CCA AAT GGC TAC CCT TCC TAC ACA CAC TGC ATC TGG AGA GTG TCT GTG 1753 
50 Pro Asn Gly Tyr Pro Ser Tyr Thr His Cys He Trp Arg Val Ser Val 

370 375 380 
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ACC CCG GGA GAA AAG ATT GTC TTG AAT TTT ACC ACA ATG GAC CTT TAC 1801 
Thr Pro Gly Glu Lys lie Val Leu Asn Phe Thr Thr Met Asp Leu Tyr 
385 390 395 

AAA AGT AGT TTG TGC TGG TAT GAT TAC ATT GAA GTA AGA GAT GGT TAC 184 9 
5 Lys Ser Ser Leu Cys Trp Tyr Asp Tyr lie Glu Val Arg Asp Gly Tyr 

400 405 410 

TGG AGG AAG TCA CCT CTC CTT GGT AGA TTC TGT GGG GAC AAA GTG GCT 1897 
Trp Arg Lys Ser Pro Leu Leu Gly Arg Phe Cys Gly Asp Lys Val Ala 
415 420 425 

10 GGA GTT CTT ACA TCT ACG GAC AGC AGA ATG TGG ATT GAG TTT CGT AGC 194 5 

Gly Val Leu Thr Ser Thr Asp Ser Arg Met Trp He Glu Phe Arg Ser 
430 435 440 445 

AGC AGT AAC TGG GTA GGA AAA GGG TTT GCA GCT GTC TAT GAA GCG ATT 1993 
Ser Ser Asn Trp Val Gly Lys Gly Phe Ala Ala Val Tyr Glu Ala He 
15 450 455 460 

TGT GGA GGG GAG ATA AGG AAA AAC GAA GGG CAG ATT CAG TCT CCC AAT 2 041 
Q Cys Gly Gly Glu He Arg Lys Asn Glu Gly Gin He Gin Ser Pro Asn 

iQ "^^^ ^'^^ '^'^^ 

^ TAC CCC GAT GAC TAC CGA CCA ATG AAG GAG TGT GTA TGG AAA ATA ATG 2089 

=pO Tyr Pro Asp Asp Tyr Arg Pro Met Lys Glu Cys Val Trp Lys He Met 

fif 480 485 490 

w 

^ GTG TCC GAG GGC TAC CAT GTT GGA CTG ACC TTT CAG GCC TTT GAG ATC 2137 

Val Ser Glu Gly Tyr His Val Gly Leu Thr Phe Gin Ala Phe Glu He 
495 500 505 

1 

G7VA AGA CAT GAC AGC TGT GCC TAT GAC CAC CTA GAA GTT CGA GAT GGA 2185 
S Glu Arg His Asp Ser Cys Ala Tyr Asp His Leu Glu Val Arg Asp Gly 

^, . 510 515 520 525 

lU 

GCC AGT GAG AAC AGC CCT TTG ATA GGA CGG TTC TGT GGT TAT GAC AAA 2233 

Ala Ser Glu Asn Ser Pro Leu He Gly Arg Phe Cys Gly Tyr Asp Lys 
S^O 530 535 540 

CCT GAA GAT ATA AGG TCT ACT TCC AAC ACC CTG TGG ATG AAG TTT GTC 2281 
Pro Glu Asp He Arg Ser Thr Ser Asn Thr Leu Trp Met Lys Phe Val 
545 550 555 

TCT GAC GGG ACT GTG AAC AAG GCA GGG TTT GCT GCG AAC TTT TTT AAA 2329 
35 Ser Asp Gly Thr Val Asn Lys Ala Gly Phe Ala Ala Asn Phe Phe Lys 

560 565 570 

GAG GAA GAT GAG TGT GCC AAA CCT GAC CGA GGA GGC TGT GAA CAG AGG 2377 
Glu Glu Asp Glu Cys Ala Lys Pro Asp Arg Gly Gly Cys Glu Gin Arg 
575 580 585 

4 0 TGT CTT AAC ACA CTA GGC AGC TAC CAG TGT GCC TGT GAG CCT GGC TAT 2425 

Cys Leu Asn Thr Leu Gly Ser Tyr Gin Cys Ala Cys Glu Pro Gly Tyr 
590 595 600 605 

GJUV CTG GGG CCA GAC AGA AGA AGC TGT GAA GCT GCT TGC GGA GGA CTT 2473 
Glu Leu Gly Pro Asp Arg Arg Ser Cys Glu Ala Ala Cys Gly Gly Leu 
45 610 615 620 

CTG ACG AAG CTC AAT GGC ACC ATA ACC ACC CCC GGC TGG CCC AAA GAG 2521 
Leu Thr Lys Leu Asn Gly , Thr He Thr Thr Pro Gly Trp Pro Lys Glu 
625 630 635 

TAC CCT CCA AAC AAA AAC TGT GTG TGG CAA GTG ATC GCG CCA AGC CAG 256 9 
50 Tyr Pro Pro Asn Lys Asn Cys Val Trp Gin Val He Ala Pro Ser Gin 

640 645 650 
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TAC AGA ATC TCT GTG AAG TTT GAG TTT TTT GAA TTG GAA GGC AAT GAA 2617 

Tyr Arg lie Ser Val Lys Phe Glu Phe Phe Glu Leu Glu Gly Asn Glu 
655 660 665 

GTT TGC AAA TAC GAT TAC GTG GAG ATC TGG AGC GGC CCT TCC TCT GAG 2665 

5 Val Cys Lys Tyr Asp Tyr Val Glu lie Trp Ser Gly Pro Ser Ser Glu 

670 675 680 685 

TCT TVAA CTG CAT GGC AAG TTC TGT GGC GCT GAC ATA CCT GAA GTG ATG 2 713 
Ser Lys Leu His Gly Lys Phe Cys Gly Ala Asp lie Pro Glu Val Met 
690 695 700 

10 ACT TCC CAT TTC AAC AAC ATG AGG ATT GAA TTC AAG TCA GAC AAC ACT 2761 

Thr Ser His Phe Asn Asn Met Arg lie Glu Phe Lys Ser Asp Asn Thr 
705 710 715 

GTA TCC AAG AAG GGC TTC AAA GCA CAT TTT TTC TCA GAT AAG GAT GAG 2809 
Val Ser Lys Lys Gly Phe Lys Ala His Phe Phe Ser Asp Lys Asp Glu 
15 720 725 730 

==i TGT TCA AAG GAT AAT GGT GGC TGT CAG CAT GAG TGT GTC AAC ACG ATG 2857 

Cys Ser Lys Asp Asn Gly Gly Cys Gin His Glu Cys Val Asn Thr Met 
735 740 745 

^ GGA AGT TAC ACG TGT CAG TGC CGG AAT GGA TTC GTG TTG CAT GAG AAC 2 905 

ri|20 Gly Ser Tyr Thr Cys Gin Cys Arg Asn Gly Phe Val Leu His Glu Asn 

750 755 760 765 

51 AAG CAT GAT TGC AAG GAA GCC GAG TGT GhA CAG AAG ATA CAC AGC CCA 2 953 

g1 Lys His Asp Cys Lys Glu Ala Glu Cys Glu Gin Lys lie His Ser Pro 

770 775 780 



==^25 AGT GGT CTC ATC ACC AGT CCC AAC TGG CCA GAC AAG TAT CCA AGC AGG 3 001 

Ser Gly Leu lie Thr Ser Pro Asn Trp Pro Asp Lys Tyr Pro Ser Arg 

IXj . 785 790 795 

p AAA GAG TGC ACG TGG GTG ATC AGT GCC ATT CCT GGC CAC CGC ATC ACA 3 049 

Q Lys Glu Cys Thr Trp Val He Ser Ala He Pro Gly His Arg He Thr 

H30 800 805 810 

TTA GCC TTC AAT GAG TTT GAG GTT GAA CAA CAT CAA GAA TGT GCT TAT 3 097 
Leu Ala Phe Asn Glu Phe Glu Val Glu Gin His Gin Glu Cys Ala Tyr 
815 820 825 

GAT CAC TTG GAA ATT TTT GAT GGA GAA ACG GAG AAG TCA CCA ATA CTT 3145 
35 Asp His Leu Glu He Phe Asp Gly Glu Thr Glu Lys Ser Pro He Leu 

830 835 840 845 

GGC CGA CTA TGT GGC AGC AAG ATA CCA GAT CCC CTC ATG GCT ACT GGG 3193 
Gly Arg Leu Cys Gly Ser Lys He Pro Asp Pro Leu Met Ala Thr Gly 
850 855 860 

40 AAT GAA ATG TTT ATT CGG TTT ATT TCT GAT GCC TCT GTT CAA AGA AAA 3241 

Asn Glu Met Phe He Arg Phe He Ser Asp Ala Ser Val Gin Arg Lys 
865 870 875 

GGC TTT CAA GCT ACA CAT TCC ACA GAG TGT GGT GGT CGA TTG AAA GCA 3289 
Gly Phe Gin Ala Thr His Ser Thr Glu Cys Gly Gly Arg Leu Lys Ala 
45 880 885 890 

GAG TCA AAG CCT AGA GAC CTG TAC TCC CAT GCT CAG TTT GGT GAT AAT 3 337 
Glu Ser Lys Pro Arg Asp Leu Tyr Ser His Ala Gin Phe Gly Asp Asn 
895 900 905 

AAC TAC CCA GGA CAA CTG GAC TGT GAA TGG TTG TTG GTG TCA GAA CGA 3 385 
50 Asn Tyr Pro Gly Gin Leu Asp Cys Glu Trp Leu Leu Val Ser Glu Arg 

910 915 920 925 
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GGA TCT CGA CTT GAA TTG TCC TTC CAG ACA TTC GAA GTA GAA GAA GAA 34 33 
Gly Ser Arg Leu Glu Leu Ser Phe Gin Thr Phe Glu Val Glu Glu Glu 
930 935 940 

GCT GAC TGT GGC TAT GAC TAT GTT GAA GTC TTT GAT GGT CTC AGT TCA 34 81 
5 Ala Asp Cys Gly Tyr Asp Tyr Val Glu Val Phe Asp Gly Leu Ser Ser 

945 950 955 

AAA GCT GTG GGT CTT GGT AGA TTC TGT GGG TCA GGG CCA CCA GAA GAA 3529 
Lys Ala Val Gly Leu Gly Arg Phe Cys Gly Ser Gly Pro Pro Glu Glu 
960 965 970 

10 ATC TAT TCA ATT GGA GAT GTG GCT TTG ATT CAT TTC CAC ACA GAT GAC 3577 

lie Tyr Ser lie Gly Asp Val Ala Leu He His Phe His Thr Asp Asp 
975 980 985 

ACT ATC T^C AAG AAA GGA TTT TAT ATA AGA TAT AAA AGT ATA AGA TAC 3625 
Thr He Asn Lys Lys Gly Phe Tyr He Arg Tyr Lys Ser He Arg Tyr 
15 990 995 1000 1005 

CCG GAA ACG ATG CAT GCC AAG AAC TAA TGCCGACCCT CCCTCAGAAC 3672 
Pro Glu Thr Met His Ala Lys Asn * 

^ ^^^^ 

S AGA/^GGAAT GTGCATATGG AAAGAAGACA TTTTTAAAAT AGACAATATT AATACAATTG 3732 

^0 TTTTATATAA TGAATTTGAG CAAAAGAAAC CTGCAAGATT AGAGTTATCT CTGAAGTGAA 3 792 

W AGAGTU^CTTT CCAGAAAACC TGATTGGCAT TGCAAGGATG TTTGAAAGTC ATGCTTGTTC 3852' 

CP 

AAGGAAGATT AACAGCTTGA AATAGATGCT TCACATTTTG GACAGTTCAT TTAATGAGCT 3 912 



u 

m 



GTGATTCTCT GGAGTGATTT CTTGACTACT TTTCCAAGAT CTGGGGACGT AGAAATGATG 3 972 

, GGACGGATCA TAGCTTGGAA ACCCACTTCT TGGGTCTTAG CATGTTTGCT TAGACTCTGT 4 032 

AGGTCAGACA CAGTGTAAAC CAAATTCATG TAAGGTGATG TGGAATAGTG GTCTTTGGAA 4 092 

GGTGGTTCAT CATTTAAATG TAGGTTTGTG CTTGTGTGTA TGTTCACATA TGCAAGTGTG 4152 

TGTGTGTGTG TGTGTGTGTG TGTGTGTGTG TGTGTGCGTG TGTGTGTGTG TGCGTGTGTG 4212 

TGTGTGTGCA TGTGTGTGCA TGTGTGTTTG GAAACTGGAA TATTTCATCT TCATTATTTT 4272 

CAAATGCAGG CCAGCTTAAC CTGTTCACAC AAATGATTTT GTGACCACTT CATTGTATCT 4 332 

30 GTATCTTGAG AAGTTTGAAA TATCTATAGT GTCTACAATG CAGTTAATCC CTAGATATCG 43 92 

GATAATACCC AGTTCACTAG TAAACTCATT TCTCTCTGGG GAAGTGCTGA ATAGTCTCCA 4452 

AATTCAAGAA ACTCACCATG TCTTATAAAC CTTTAAGATA AAATTCCAAC GAGGTGTGTG 4 512 

CAGCCATCTT CCTU^TGACT GCCTGGATGT TTCTTAGTCC AGTTACTACT GCTGCTGCTA 4 572 

TTGGTCTTTC TTTTATTGTT AATGTGTTGA TATGTTGTTA TTATTATGGT TATTATTATT 4632 

35 GATGTTGTTA CTATATTTT^ AAATGATGAG ATGAAGTGGA AGTAGAGTTT GGGAGAAATG 4692 

AAATCTCTCT TTTTTGTTCT CTTCTTGAAA TTCAGTTTCA AAAAATACAA TATTGGGTGG 4 752 

CAAAAAAAAA AAT^^AAAAA 4771 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1014 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Met Gly Leu Gin Ala Leu Ser Pro Arg Met Leu Leu Trp Leu Val Val 

1 5 10 

ser Gly He Val Phe Ser Arg Val Leu Trp Val Cys Ala Gly Leu Asp 



20 



Tyr ASP Tyr Thr Phe Asp Gly Asn Glu Glu Asp Lys Thr Glu Pro He 
35 40 

ASP Tyr Lys Asp Pro Cys Lys Ala Ala Val Phe Trp Gly Asp He Ala 
50 55 60 

Leu ASP ASP Glu ASP Leu Asn He Phe Gin He Asp Arg Thr He Asp 



65 



70 



Leu Thr Gin Ser Pro Phe Gly Lys Leu Gly His He Thr Gly Gly Phe 
85 ^0 

Gly Asp His Gly Met Pro Lys Lys Arg Gly Ala Leu Tyr Gin Leu He 

Glu Arg He Arg Arg He Gly Ser Gly Leu Glu Gin Asn Asn Thr Met 
115 120 125 

Lys Gly Lys Ala Pro Pro Lys Leu Ser Glu Gin Ser Glu Lys Asn Arg 
-' •* 135 140 



130 



val pro Arg Ala Ala Thr Ser Arg Thr Glu Arg He Trp Pro Gly Gly 
145 150 155 

val He Pro Tyr Val He Gly Gly Asn Phe Thr Gly Ser Gin Arg Ala 

Met Phe Lys Gin Ala Met Arg His Trp Glu Lys His Thr Cys Val Thr 
180 185 19° 

Phe Thr Glu Arg Ser Asp Glu Glu Ser Tyr He Val Phe Thr Tyr Arg 
195 200 205 

Pro Cys Gly Cys Cys Ser Tyr Val Gly Arg Arg Gly Asn Gly Pro Gin 
210 215 220 

Ala He Ser He Gly Lys Asn Cys Asp Lys Phe Gly He Val Val His 
225 230 235 

Glu Leu Gly His val He Gly Phe Trp His Glu His Thr Arg Pro Asp 
245 250 255 

Arg Asp Asn His Val Thr He He Arg Glu Asn He Gin Pro Gly Gin 

^ ^ 265 270 



260 



Glu Tyr Asn Phe Leu Lys Met Glu Pro Gly Glu Val Asn Ser Leu Gly 
275 280 285 

Glu Arg Tyr Asp Phe Asp Ser He Met His Tyr Ala Arg Asn Thr Phe 
290 295 300 



fy 



Ser Arg Gly Met Phe Leu Asp Thr lie Leu Pro Ser Arg Asp Asp Asn 
305 310 315 320 

Gly He Arg Pro Ala He Gly Gin Arg Thr Arg Leu Ser Lys Gly Asp 
325 330 335 

5 He Ala Gin Ala Arg Lys Leu Tyr Arg Cys Pro Ala Cys Gly Glu Thr 

340 345 350 

Leu Gin Glu Ser Ser "Gly Asn Leu Ser Ser Pro Gly Phe Pro Asn Gly 
355 360 365 

Tyr Pro Ser Tyr Thr His Cys He Trp Arg Val Ser Val Thr Pro Gly 
10 370 375 380 

Glu Lys He Val Leu Asn Phe Thr Thr Met Asp Leu Tyr Lys Ser Ser 
385 390 395 400 

Leu Cys Trp Tyr Asp Tyr He Glu Val Arg Asp Gly Tyr Trp Arg Lys 
„ 405 410 415 

Q 

^5 Ser Pro Leu Leu Gly Arg Phe Cys Gly Asp Lys Val Ala Gly Val Leu 

"^-^^ 



Thr Ser Thr Asp Ser Arg Met Trp He Glu Phe Arg Ser Ser Ser Asn 
435 440 445 



gi Trp Val Gly Lys Qly Phe Ala Ala Val Tyr Glu Ala He Cys Gly Gly 

^0 450 455 460 

^ Glu He Arg Lys Asn Glu Gly Gin He Gin Ser Pro Asn Tyr Pro Asp 

d 465 470 475 480 

iJJ 

Asp Tyr Arg Pro Met Lys Glu Cys Val Trp Lys He Met Val Ser Glu 
485 490 495 



QS Gly Tyr His Val Gly Leu Thr Phe Gin Ala Phe Glu He Glu Arg His 

500 505 510 

Asp Ser Cys Ala Tyr Asp His Leu Glu Val Arg Asp Gly Ala Ser Glu 
515 520 525 

Asn Ser Pro Leu He Gly Arg Phe Cys Gly Tyr Asp Lys Pro Glu Asp 
30 530 535 540 

He Arg Ser Thr Ser Asn Thr Leu Trp Met Lys Phe Val Ser Asp Gly 
545 550 555 560 

Thr Val Asn Lys Ala Gly Phe Ala Ala Asn Phe Phe Lys Glu Glu Asp 
565 570 575 

3 5 Glu Cys Ala Lys Pro Asp Arg Gly Gly Cys Glu Gin Arg Cys Leu Asn 

580 585 590 

Thr Leu Gly Ser Tyr Gin Cys Ala Cys Glu Pro Gly Tyr Glu Leu Gly 
595 600 605 

Pro Asp Arg Arg Ser Cys Glu Ala Ala Cys Gly Gly Leu Leu Thr Lys 
40 610 615 620 

Leu Asn Gly Thr He Thr Thr Pro Gly Trp Pro Lys Glu Tyr Pro Pro 
625 630 635 640 

Asn Lys Asn Cys Val Trp Gin Val He Ala Pro Ser Gin Tyr Arg He 
645 650 655 
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# 



Ser Val Lys Phe Glu Phe Phe Glu Leu Glu Gly Asn Glu Val Cys Lys 
660 665 670 

Tyr Asp Tyr Val Glu lie Trp Ser Gly Pro Ser Ser Glu Ser Lys Leu 
675 680 685 

5 His Gly Lys Phe Cys Gly Ala Asp He Pro Glu Val Met Thr Ser His 

690 695 700 

Phe Asn Asn Met Arg He Glu Phe Lys Ser Asp Asn Thr Val Ser Lys 
705 710 715 720 

Lys Gly Phe Lys Ala His Phe Phe Ser Asp Lys Asp Glu Cys Ser Lys 
10 725 730 735 

Asp Asn Gly Gly Cys Gin His Glu Cys Val Asn Thr Met Gly Ser Tyr 
740 745 750 

Thr Cys Gin Cys Arg Asn Gly Phe Val Leu His Glu Asn Lys His Asp 
755 760 765 

Q.5 Cys Lys Glu Ala Glu Cys Glu Gin Lys He His Ser Pro Ser Gly Leu 

3 770 775 780 

He Thr Ser Pro Asn Trp Pro Asp Lys Tyr Pro Ser Arg Lys Glu Cys 
^ 785 790 795 800 



ly 



Thr Trp Val He 3er Ala He Pro Gly His Arg He Thr Leu Ala Phe 
805 810 815 

yl Asn Glu Phe Glu Val Glu Gin His Gin Glu Cys Ala Tvr Asp His Leu 

^ 820 825 830 

Q 

^ Glu He Phe Asp Gly Glu Thr Glu Lys Ser Pro He Leu Gly Arg Leu 

'^•^ 835 840 845 

ni 

ig5 Cys Gly Ser Lys He Pro Asp Pro Leu Met Ala Thr Gly Asn Glu Met 

p 850 855 860 

-f"^ Phe He Arg Phe He Ser Asp Ala Ser Val Gin Arg Lys Gly Phe Gin 

865 870 875 880 

Ala Thr His Ser Thr Glu Cys Gly Gly Arg Leu Lys Ala Glu Ser Lys 
30 885 890 895 

Pro Arg Asp Leu Tyr Ser His Ala Gin Phe Gly Asp Asn Asn Tyr Pro 
900 905 910 

Gly Gin Leu Asp Cys Glu Tirp Leu Leu Val Ser Glu Arg Gly Ser Arg 
915 920 925 

35 Leu Glu Leu Ser Phe Gin Thr Phe Glu Val Glu Glu Glu Ala Asp Cys 

930 935 940 

Gly Tyr Asp Tyr Val Glu Val Phe Asp Gly Leu Ser Ser Lys Ala Val 
945 950 955 960 

Gly Leu Gly Arg Phe Cys Gly Ser Gly Pro Pro Glu Glu He Tyr Ser 
40 965 970 975 

He Gly Asp Val Ala Leu He His Phe His Thr Asp Asp Thr He Asn 
980 985 990 

Lys Lys Gly Phe Tyr He Arg Tyr Lys Ser He Arg Tyr Pro Glu Thr 
995 1000 1005 
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Met His Ala l,ys Asn * 
1010 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 3919 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 648.. 3689 

dlB (D) OTHER INFORMATION: /product= "human mTll protein" 

^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

rj CTCACACTTT TGCTCTCTTG CAGTCAGTTG CTTTGCTGGC TTCTGCAGGC TTTTAAGGTC 6 0 



TCGCGGCGTA GAAATGCCTG GCCCCCACCC CCTTCCTCGG TCTCCCCTTT CAATTCAGAT 120 
GTGCTGATGT GCAGACCGGA TTCATCTTCT CGGAGCTGCG GCGGCGGCTT TGGGCTCAGG 180 



go CGGCGGCGGC TCGCGCTCGG CCGCGGAGTC CTGGCAGCAG CGGGGACGCG GCGCGGGAGT 24 0 

CCGAGCTCTG GTGGCAGCTG AGCCCGCGGG GCGCCGCTCG CCGAGCCGCG GCCGCGGGAA 300 

ru 

isfl GTTCGGCAGC CAGAAGGACG ACCTGGCAGG CTGCGAGCGC CAGCGCCGCC AGAGCCGAGT 360 

o 

TTGCCTGCGC CCTCCCCGCC TCCGAGTGCA GAGTTCCTTA CCTGCCCTCC GCCCACCCGT 420 

GGGCCCCTAG CCAACTTCTC CCTGCGACTG GGGGTAACAG GCAGTGCTTG CCCTCTCTAC 480 

25 TGTCCCGGCG GCATCCACAT GTTTCCGGAC ACCTGAGCAC CCCGGTCCCG CCGAGGAGCC 54 0 

TCCGGGTGGG GAGAAGAGCA CCGGTGCCCC TAGCCCCGCA CATCAGCGCG GACCGCGGCT 600 

GCCTAACCTC TGGGTCCCGT CCCCTCCTTT TCCTCCGGGG GAGGAGG ATG GGG TTG 656 

Met Gly Leu 
1015 

30 GGA ACG CTT TCC CCG AGG ATG CTC GTG TGG CTG GTG GCC TCG GGG ATT 704 

Gly Thr Leu Ser Pro Arg Met Leu Val Trp Leu Val Ala Ser Gly lie 
1020 1025 1030 

GTT TTC TAG GGG GAG CTA TGG GTC TGC GCT GGC 'CTC GAT TAT GAT TAC 752 
Val Phe Tyr Gly Glu Leu Trp Val Cys Ala Gly Leu Asp Tyr Asp Tyr 
35 1035 1040 1045 

ACT TTT GAT GGG AAC GAA GAG GAT AAA ACA GAG ACT ATA GAT TAC AAG 800 
Thr Phe Asp Gly Asn Glu Glu Asp Lys Thr Glu Thr lie Asp Tyr Lys 
1050 1055 1060 1065 

GAC CCG TGT AAA GCC GCT GTA TTT TGG GGC GAT ATT GCC TTA GAT GAT 848 
4 0 Asp Pro Cys Lys Ala Ala Val Phe Trp Gly Asp He Ala Leu Asp Asp 

1070 1075 1080 
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GAA GAC TTA AAT ATC TTT CAA ATA GAT AGG AC A ATT GAC CTT ACG CAG 896 
Glu Asp Leu Asn lie Phe Gin lie Asp Arg Thr He Asp Leu Thr Gin 
1085 1090 1095 

AAC CCC TTT GGA AAC CTT GGA CAT ACC ACA GGT GGA CTT GGA GAC CAT 944 
5 Asn Pro Phe Gly Asn Leu Gly His Thr Thr Gly Gly Leu Gly Asp His 

1100 1105 1110 

GCT ATG TCA AAG AAG CGA GGG GCC CTC TAC CAA CTT ATA GAC AGG ATA 992 
Ala Met Ser Lys Lys Arg Gly Ala Leu Tyr Gin Leu He Asp Arg He 
1115 1120 1125 

10 AGA AGA ATT GGC TTT GGC TTG GAG CAA AAC AAC ACA GTT AAG GGA AAA 1040 

Arg Arg He Gly Phe Gly Leu Glu Gin Asn Asn Thr Val Lys Gly Lys 
1130 1135 1140 1145 

GTA CCT CTA CAA TTC TCA GGG CAA AAT GAG AAA AAT CGA GTT CCC AGA 1088 
Val Pro Leu Gin Phe Ser Gly Gin Asn Glu Lys Asn Arg Val Pro Arg 
15 1150 1155 1160 

GCC GCT ACA TCA AGA ACG GAA AGA ATA TGG CCT GGA GGC GTT ATT CCT 1136 
Q Ala Ala Thr Ser Arg Thr Glu Arg He Trp Pro Gly Gly Val He Pro 

.fi 1165 1170 1175 

TAT GTT ATA GGA GGA AAC TTC ACT GGC AGC CAG AGA GCC ATG TTC AAG 1184 
So Tyr Val He Gly Gly Asn Phe Thr Gly Ser Gin Arg Ala Met Phe Lys 

|ij 1180 1185 1190 

^ CAG GCC ATG AGG CAC TGG GAA AAG CAC ACA TGT GTG ACT TTC ATA GAA 123 2 

Gin Ala Met Arg His Trp Glu Lys His Thr Cys Val Thr Phe He Glu 
91 1195 1200 1205 



f35 AGA AGT GAT GAA GAG AGT TAC ATT GTA TTC ACC TAT AGG CCT TGT GGA 1280 

^ Arg Ser Asp Glu Glu Ser Tyr He Val Phe Thr Tyr Arg Pro Cys Gly 

^ 1210 1215 1220 1225 

fy 

iQ TGC TGC TCC TAT GTA GGT CGG CGA GGA AAT GGA CCT CAG GCA ATC TCT 1328 

Cys Cys Ser Tyr Val Gly Arg Arg Gly Asn Gly Pro Gin Ala He Ser 
}5'0 1230 1235 1240 

X 

ATC GGC AAG AAC TGT GAT AAA TTT GGG ATT GTT GTT CAT GAA TTG GGT 1376 
He Gly Lys Asn Cys Asp Lys Phe Gly He Val Val His Glu Leu Gly 
1245 1250 1255 

CAT GTG ATA GGC TTT TGG CAT GAA CAC ACA AGA CCA GAT CGA GAT AAC 1424 

3 5 His Val He Gly Phe Trp His Glu His Thr Arg Pro Asp Arg Asp Asn 

1260 1265 1270 

CAC GTA ACT ATC ATA AGA GAA AAC ATC CAG CCA GGT CAA GAG TAC AAT 14 72 
His Val Thr He He Arg Glu Asn He Gin Pro Gly Gin Glu Tyr Asn 
1275 1280 1285 

4 0 TTT CTG AAG ATG GAG CCT GGA GAA GTA AAC TCA CTT GGA GAA AGA TAT 1520 

Phe Leu Lys Met Glu Pro Gly Glu Val Asn Ser Leu Gly Glu Arg Tyr 
1290 1295 1300 1305 

GAT TTC GAC AGT ATC ATG CAC TAT GCC AGG AAC ACC TTC TCA AGG GGG 1568 
Asp Phe Asp Ser He Met His Tyr Ala Arg Asn Thr Phe Ser Arg Gly 
45 1310 1315 1320 

ATG TTT CTG GAT ACC ATT CTC CCC TCC CGT GAT GAT AAT GGC ATA CGT 1616 
Met Phe Leu Asp Thr He Leu Pro Ser Arg Asp Asp Asn Gly He Arg 
1325 1330 1335 

CCT GCA ATT GGT CAG CGA ACC CGT CTA AGC AAA GGA GAT ATC GCA CAG 1664 
50 Pro Ala He Gly Gin Arg Thr Arg Leu Ser Lys Gly Asp He Ala Gin 

1340 1345 1350 
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GCA AGA AAG CTG TAT AGA TGT CCA GCA TGT GGA GAA ACT CTA CAA GAA 1712 
Ala Arg Lys Leu Tyr Arg Cys Pro Ala Cys Gly Glu Thr Leu Gin Glu 
1355 1360 1365 

TCC AAT GGC AAC CTT TCC TCT CCA GGA TTT CCC AAT GGC TAG CCT TCT 1760 
5 Ser Asn Gly Asn Leu Ser Ser Pro Gly Phe Pro Asn Gly Tyr Pro Ser 

1370 1375 1380 1385 

TAC ACA CAC TGC ATC TGG AGA GTT TCT GTG ACC CCA GGG GAG AAG ATT 18 08 
Tyr Thr His Cys lie Trp Arg Val Ser Val Thr Pro Gly Glu Lys lie 
1390 1395 1400 

10 GTT TTA AAT TTT ACA ACG ATG GAT CTA TAC AAG AGT AGT TTG TGC TGG 1856 

Val Leu Asn Phe Thr Thr Met Asp Leu Tyr Lys Ser Ser Leu Cys Trp 
1405 1410 1415 

TAT GAC TAT ATT GAA GTA AGA GAC GGG TAC TGG AGA AAA TCA CCT CTC 1904 
Tyr Asp Tyr lie Glu Val Arg Asp Gly Tyr Trp Arg Lys Ser Pro Leu 
15 1420 1425 1430 

□ CTT GGT AGA TTC TGT GGG GAC AAA TTG CCT GAA GTT CTT ACT TCT ACA 1952 

Leu Gly Arg Phe Cys Gly Asp Lys Leu Pro Glu Val Leu Thr Ser Thr 



y3 



1435 1440 1445 



GAC AGC AGA ATG TGG ATT GAG TTT CGT AGC AGC AGT AAT TGG GTA GGA 2000 
PdO Asp Ser Arg Met Trp lie Glu Phe Arg Ser Ser Ser Asn Trp Val Gly 

yj 1450 1455 1460 1465 

AAA GGC TTT GCA GCT GTC TAT GAA GCG ATC TGT GGA GGT GAG ATA CGT 2048 
Lys Gly Phe Ala Ala Val Tyr Glu Ala lie Cys Gly Gly Glu lie Arg 
B 1470 1475 1480 

fi 

35 AAA AAT GAA GGA CAG ATT CAG TCT CCC AAT TAT CCT GAT GAC TAT CGC 2096 

I* Lys Asn Glu Gly Gin lie Gin Ser Pro Asn Tyr Pro Asp Asp Tyr Arg 

, 1485 1490 1495 

p CCG ATG AAA GAA TGT GTG TGG AAA ATA ACA GTG TCT GAG AGC TAC CAC 2144 

1^ Pro Met Lys Glu Cys Val Trp Lys He Thr Val Ser Glu Ser Tyr His 

'"30 1500 1505 1510 

GTC GGG CTG ACC TTT CAG TCC TTT GAG ATT GAA AGA CAT GAC AAT TGT 2192 
Val Gly Leu Thr Phe Gin Ser Phe Glu He Glu Arg His Asp Asn Cys 
1515 1520 1525 

GCT TAT GAC TAC CTG GAA GTT AGA GAT GGA ACC AGT GAA AAT AGC CCT 224 0 
35 Ala Tyr Asp Tyr Leu Glu Val Arg Asp Gly Thr Ser Glu Asn Ser Pro 

1530 1535 1540 1545 

TTG ATA GGG CGT TTC TGT GGT TAT GAC AAA CCT GAA GAC ATA AGA TCT 2288 
Leu He Gly Arg Phe Cys Gly Tyr Asp Lys Pro Glu Asp He Arg Ser 
1550 1555 1560 

4 0 ACC TCC AAT ACT TTG TGG ATG AAG TTT GTT TCT GAC GGA ACT GTG AAC 2336 

Thr Ser Asn Thr Leu Trp Met Lys Phe Val Ser Asp Gly Thr Val Asn 
1565 1570 1575 

AAA GCA GGG TTT GCT GCT AAC TTT TTT AAA GAG GAA GAT GAG TGT GCC 2384 
Lys Ala Gly Phe Ala Ala Asn Phe Phe Lys Glu Glu Asp Glu Cys Ala 
45 1580 1585 1590 

AAA CCT GAC CGT GGA GGC TGT GAG CAG CGA TGT CTG AAC ACT CTG GGC 2432 
Lys Pro Asp Arg Gly Gly Cys Glu Gin Arg Cys Leu Asn Thr Leu Gly 
1595 1600 1605 

AGT TAC CAG TGT GCC TGT GAG CCT GGC TAT GAG CTG GGC CCA GAC AGA 2480 
50 Ser Tyr Gin Cys Ala Cys Glu Pro Gly Tyr Glu Leu Gly Pro Asp Arg 

1610 1615 ' 1620 1625 
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AGG AGC TGT GAA GCT GCT TGT GOT GGA CTT CTT ACC AAA CTT AAC GGC 2528 
Arg Ser Cys Glu Ala Ala Cys Gly Gly Leu Leu Thr Lys Leu Asn Gly 
1630 1635 1640 

ACC ATA ACC ACC CCT GGC TGG CCC AAG GAG TAC CCT CCT AAT AAG AAC 2 576 
5 Thr lie Thr Thr Pro Gly Trp Pro Lys Glu Tyr Pro Pro Asn Lys Asn 

1645 1650 1655 

TGT GTG TGG CAA GTG GTT GCA CCA ACC CAG TAC AGA ATT TCT GTG AAG 2624 
Cys Val Trp Gin Val Val Ala Pro Thr Gin Tyr Arg lie Ser Val Lys 
1660 1665 1670 

10 TTT GAG TTT TTT GAA TTG GAA GGC AAT GAA GTT TGC AAA TAT GAT TAT 26 72 

Phe Glu Phe Phe Glu Leu Glu Gly Asn Glu Val Cys Lys Tyr Asp Tyr 
1675 1680 1685 

GTG GAG ATC TGG AGT GGT CTT TCC TCT GAG TCT AAA CTG CAT GGC AAA 2720 
Val Glu lie Trp Ser Gly Leu Ser Ser Glu Ser Lys Leu His Gly Lys 
15 1690 1695 1700 1705 

TTC TGT GGC GCT GAA GTG CCT GAA GTG ATC ACA TCC CAG TTC AAC AAT 2 768 
Phe Cys Gly Ala Glu Val Pro Glu Val lie Thr Ser Gin Phe Asn Asn 
^ 1710 1715 1720 

Cj ATG AGA ATT GAA TTC AAA TCT GAC AAT ACT GTA TCC AAG AAG GGC TTC 2816 

^0 Met Piirg lie Glu Phe Lys Ser Asp Asn Thr Val Ser Lys Lys Gly Phe 

i": 1725 1730 1735 

u I 
! ^ 

W AAA GCA CAT TTT TTC TCA GAC AAA GAT GAA TGC TCT AAG GAT AAT GGT 2 864 

gi Lys Ala His Phe Phe Ser Asp Lys Asp Glu Cys Ser Lys Asp Asn Gly 
^ 1740 1745 1750 

^5 GGA TGT CAG CAC GAA TGT GTC AAC ACG ATG GGG AGC TAC ATG TGT CAA 2 912 

U Gly Cys Gin His Glu Cys Val Asn Thr Met Gly Ser Tyr Met Cys Gin 
m 1755 1760 1765 



fii 

Z TGC CGT AAT GGA TTT GTG CTA CAT GAC AAT AAA CAT GAT TGC AAG GAA 2 960 

Cys Arg Asn Gly Phe Val Leu His Asp Asn Lys His Asp Cys Lys Glu 
Ut) 1770 1775 1780 1785 

GCT GAG TGT GAA CAG AAG ATC CAC AGT CCA AGT GGC CTC ATC ACC AGT 3 008 
Ala Glu Cys Glu Gin Lys lie His Ser Pro Ser Gly Leu lie Thr Ser 
1790 1795 1800 

CCC AAC TGG CCA GAC AAG TAC CCA AGC AGG AAA GAA TGC ACT TGG GAA 3 056 
3 5 Pro Asn Trp Pro Asp Lys Tyr Pro Ser Arg Lys Glu Cys Thr Trp Glu 

1805 1810 1815 

ATC AGC GCC ACT CCC GGC CAC CGA ATC AAA TTA GCC TTT AGT GAA TTT 3104 
lie Ser Ala Thr Pro Gly His Arg He Lys Leu Ala Phe Ser Glu Phe 
1820 1825 1830 

40 GAG ATT GAG CAG CAT CAA GAA TGT GCT TAT GAC CAC TTA GAA GTA TTT 3152 

Glu He Glu Gin His Gin Glu Cys Ala Tyr Asp His Leu Glu Val Phe 
1835 1840 1845 

GAT GGA GAA ACA GAA AAG TCA CCG ATT CTT GGA CGA CTA TGT GGC AAC 3200 
Asp Gly Glu Thr Glu Lys Ser Pro He Leu Gly Arg Leu Cys Gly Asn 
45 1850 1855 1860 1865 

AAG ATA CCA GAT CCC CTT GTG GCT ACT GGA AAT AAA ATG TTT GTT CGG 324 8 
Lys He Pro Asp Pro Leu Val Ala Thr Gly Asn Lys Met Phe Val Arg 
1870 1875 1880 

TTT GTT TCT GAT GCA TCT GTT CAA AGA AAA GGC TTT CAA GCC ACA CAT 3296 
50 Phe Val Ser Asp Ala Ser Val Gin Arg Lys Gly Phe Gin Ala Thr His 

1885 1890 1895 
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TCT ACA GAG TGT GGC GGA CGA TTG AAA GCA GAA TCA AAA CCA AGA GAT 3344 
Ser Thr Glu Cys Gly Gly Arg Leu Lys Ala Glu Ser Lys Pro Arg Asp 
1900 1905 1910 

CTG TAC TCA CAT GCT CAG TTT GGT GAT AAC AAC TAG CCA GGA CAG GTT 3 3 92 
5 Leu Tyr Ser His Ala Gin Phe Gly Asp Asn Asn Tyr Pro Gly Gin Val 

1915 1920 1925 

GAC TGT GAA TGG CTA TTA GTA TCA GAA CGG GGC TCT CGA CTT GAA TTA 344 0 
Asp Cys Glu Trp Leu Leu Val Ser Glu Arg Gly Ser Arg Leu Glu Leu 
1930 1935 1940 1945 

10 TCC TTC CAG ACA TTT GAA GTG GAG GAA GAA GCA GAC TGT GGC TAT GAC 348 8 

Ser Phe Gin Thr Phe Glu Val Glu Glu Glu Ala Asp Cys Gly Tyr Asp 
1950 1955 1960 

TAT GTG GAG CTC TTT GAT GGT CTT GAT TCA ACA GCT GTG GGG CTT GGT 3536 
Tyr Val Glu Leu Phe Asp Gly Leu Asp Ser Thr Ala Val Gly Leu Gly 
15 1965 1970 1975 

g CGA TTC TGT GGA TCC GGG CCA CCA GAA GAG ATT TAT TCA ATT GGA GAT 3 584 

^ Arg Phe Cys Gly Ser Gly Pro Pro Glu Glu lie Tyr Ser lie Gly Asp 

1980 1985 1990 

€i 

£ TCA GTT TTA ATT CAT TTC CAC ACT GAT GAC ACA ATC AAC AAG AAG GGA 3632 

^i) Ser Val Leu He His Phe His Thr Asp Asp Thr He Asn Lys Lys Gly 

I i 1995 2000 2005 

yj 

W TTT CAT ATA AGA TAC AAA AGC ATA AGA TAT CCA GAT ACC ACA CAT ACC 3 680 

ff\ Phe His He Arg Tyr Lys Ser He Arg Tyr Pro Asp Thr Thr His Thr 

2010 2015 2020 2025 

fe AAA AAA TAA CACCAAAACC TCTGTCAGAA CACAAAGGAA TGTGCATAAT 3 72 9 

Lys Lys * 

fy / 

^ GGAGAGAAGA CATATTTTTT TTAAAACTGA AGATATTGGC ACAAATGTTT TATACAAAGA 3789 

M= GTTTGAACAA AAAATCCCTG TAAGACCAGA ATTATCTTTG TACTAAAAGA GAAGTTTCCA 3849 

30 GCAAAACCCT CATCAGCATT ACAAGGATAT TTGAACTCCA TGCTTGATGG TATTAATAAA 3909 

GCTGGTGAAA 3919 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1014 amino acids 
35 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECXJFLE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Gly Leu Gly Thr Leu Ser Pro Arg Met Leu Val Trp Leu Val Ala 
40 1 5 10 15 

Ser Gly He Val Phe Tyr Gly Glu Leu Trp Val Cys Ala Gly /Leu Asp 
20 25 30 

Tyr Asp Tyr Thr Phe Asp Gly Asn Glu Glu Asp Lys Thr Glu Thr He 
35 40 45 

45 Asp Tyr Lys Asp Pro Cys Lys Ala Ala Val Phe Trp Gly Asp He Ala 

50 55 60 
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Leu Asp Asp Glu Asp Leu Asn lie Phe Gin lie Asp Arg Thr lie Asp 
65 70 75 80 

Leu Thr Gin Asn Pro Phe Gly Asn Leu Gly His Thr Thr Gly Gly Leu 
85 90 95 

Gly Asp His Ala Met Ser Lys Lys Arg Gly Ala Leu Tyr Gin Leu lie 
100 105 110 

Asp Arg lie Arg Arg lie Gly Phe Gly Leu Glu Gin Asn Asn Thr Val 
115 120 125 

Lys Gly Lys Val Pro Leu Gin Phe Ser Gly Gin Asn Glu Lys Asn Arg 
130 135 140 

Val Pro Arg Ala Ala Thr Ser Arg Thr Glu Arg lie Trp Pro Gly Gly 
145 150 155 160 

Val lie Pro Tyr Val lie Gly Gly Asn Phe Thr Gly Ser Gin Arg Ala 
165 170 175 

Met Phe Lys Gin Ala Met Arg His Trp Glu Lys His Thr Cys Val Thr 
180 185 190 

Phe lie Glu Arg Ser Asp Glu Glu Ser Tyr lie Val Phe Thr Tyr Arg 
195 200 205 

Pro Cys Gly Cys Cys Ser Tyr Val Gly Arg Arg Gly Asn Gly Pro Gin 
210 215 220 

Ala lie Ser lie Gly Lys Asn Cys Asp Lys Phe Gly He Val Val His 
225 230 235 .240 

Glu Leu Gly His Val He Gly Phe Trp His Glu His Thr Arg Pro Asp 
245 250 255 

Arg Asp Asn His Val Thr He He Arg Glu Asn He Gin Pro Gly Gin 
260 265 270 

Glu Tyr Asn Phe Leu Lys Met Glu Pro Gly Glu Val Asn Ser Leu Gly 
275 280 285 

Glu Arg Tyr Asp Phe Asp Ser He Met His Tyr Ala Arg Asn Thr Phe 
290 295 300 

Ser Arg Gly Met Phe Leu Asp Thr He Leu Pro Ser Arg Asp Asp Asn 
305 310 315 320 

Gly He Arg Pro Ala He Gly Gin Arg Thr Arg Leu Ser Lys Gly Asp 
325 330 335 

He Ala Gin Ala Arg Lys Leu Tyr Arg Cys Pro Ala Cys Gly Glu Thr 
340 345 350 

Leu Gin Glu Ser Asn Gly Asn Leu Ser Ser Pro Gly Phe Pro Asn Gly 
355 360 365 

Tyr Pro Ser Tyr Thr His Cys He Trp Arg Val Ser Val Thr Pro Gly 
370 375 380 

Glu Lys He Val Leu Asn Phe Thr Thr Met Asp Leu Tyr Lys Ser Ser 
385 390 395 400 

Leu Cys Trp Tyr Asp Tyr He Glu Val Arg Asp Gly Tyr Trp Arg Lys 
405 410 415 
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# 



Ser Pro Leu Leu Gly Arg Phe Cys Gly Asp Lys Leu Pro Glu Val Leu 

420 425 430 

Thr Ser Thr Asp Ser Arg Met Trp He Glu Phe Arg Ser Ser Ser Asn 

435 440 445 

5 Trp Val Gly Lys Gly Phe Ala Ala Val Tyr Glu Ala He Cys Gly Gly 

450 455 460 

Glu He Arg Lys Asn Glu Gly Gin He Gin Ser Pro Asn Tyr Pro Asp 

465 470 475 480 

Asp Tyr Arg Pro Met Lys Glu Cys Val Trp Lys He Thr Val Ser Glu 

10 485 490 495 

Ser Tyr His Val Gly Leu Thr Phe Gin Ser Phe Glu He Glu Arg His 

500 505 510 

Asp Asn Cys Ala Tyr Asp Tyr Leu Glu Val Arg Asp Gly Thr Ser Glu 

515 520 525 

4^5 Asn Ser Pro Leu He Gly Arg Phe Cys Gly Tyr Asp Lys Pro Glu Asp 



530 535 540 

He Arg Ser Thr Ser Asn Thr Leu Trp Met Lys Phe Val Ser Asp Gly 
545 550 555 560 



Uj Thr Val Asn Lys Ala Gly Phe Ala Ala Asn Phe Phe Lys Glu Glu Asp 

mo 565 570 575 

Glu Cys Ala Lys Pro Asp Arg Gly Gly Cys Glu Gin Arg Cys Leu Asn 
" 580 585 590 

ffl Thr Leu Gly Ser Tyr Gin Cys Ala Cys Glu Pro Gly Tyr Glu Leu Gly 

fU 

Pro Asp Arg Arg Ser Cys Glu Ala Ala Cys Gly Gly Leu Leu Thr Lys 
O 610 615 620 

M» 

Leu Asn Gly Thr He Thr Thr Pro Gly Trp Pro Lys Glu Tyr Pro Pro 
625 630 635 640 

Asn Lys Asn Cys Val Trp Gin Val Val Ala Pro Thr Gin Tyr Arg He 
30 645 650 655 

Ser Val Lys Phe Glu Phe Phe Glu Leu Glu Gly Asn Glu Val Cys Lys 
660 665 670 

Tyr Asp Tyr Val Glu He Trp Ser Gly Leu Ser Ser Glu Ser Lys Leu 
675 680 685 

3 5 His Gly Lys Phe Cys Gly Ala Glu Val Pro Glu Val He Thr Ser Gin 

690 695 700 

Phe Asn Asn Met Arg He Glu Phe Lys Ser Asp Asn Thr Val Ser Lys 
705 710 715 720 

Lys Gly Phe Lys Ala His Phe Phe Ser Asp Lys Asp Glu Cys Ser Lys 
40 725 730 735 

Asp Asn Gly Gly Cys Gin His Glu Cys Val Asn Thr Met Gly Ser Tyr 
740 745 750 

Met Cys Gin Cys Arg Asn Gly Phe Val Leu His Asp Asn Lys His Asp 
755 760 765 
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Cys Lys Glu Ala Glu Cys Glu Gin Lys lie His Ser Pro Ser Gly Leu 
770 775 780 

lie Thr Ser Pro Asn Trp Pro Asp Lys Tyr Pro Ser Arg Lys Glu Cys 
785 790 795 800 

5 Thr Trp Glu lie Ser Ala Thr Pro Gly His Arg lie Lys Leu Ala Phe 

805 810 815 

Ser Glu Phe Glu lie Glu Gin His Gin Glu Cys Ala Tyr Asp His Leu 
820 825 830 

Glu Val Phe Asp Gly Glu Thr Glu Lys Ser Pro lie Leu Gly Arg Leu 
10 835 840 845 

Cys Gly Asn Lys lie Pro Asp Pro Leu Val Ala Thr Gly Asn Lys Met 
850 855 860 

Phe Val Arg Phe Val Ser Asp Ala Ser Val Gin Arg Lys Gly Phe Gin 
865 870 875 880 

."£5 Ala Thr His Ser Thr Glu Cys Gly Gly Arg Leu Lys Ala Glu Ser Lys 

885 890 895 

«P Pro Arg Asp Leu Tyr Ser His Ala Gin Phe Gly Asp Asn Asn Tyr Pro 

flj 900 905 910 

i I 

^„ Gly Gin Val Asp Cys Glu Trp Leu Leu Val Ser Glu Arg Gly Ser Arg 

915 920 925 

s Leu Glu Leu Ser Phe Gin Thr Phe Glu Val Glu Glu Glu Ala Asp Cys 

p 930, 935 940 

5^ Gly Tyr Asp Tyr Val Glu Leu Phe Asp Gly Leu Asp Ser Thr Ala Val 

lU 945 950 955 960 

B'B Gly Leu Gly Arg Phe Cys Gly Ser Gly Pro Pro Glu Glu He Tyr Ser 

^ 965 970 975 

He Gly Asp Ser Val Leu He His Phe His Thr Asp Asp Thr He Asn 
980 985 990 



Lys Lys Gly Phe His He Arg Tyr Lys Ser He Arg Tyr Pro Asp Thr 
30 995 1000 1005 

Thr His Thr Lys Lys * 
1010 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 2 0 base pairs 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
40 (A) DESCRIPTION: /desc = "oligonucleotide primer" 

<xi) SEQUENCE DESCRIPTION; SEQ ID NO: 6: 
CCAGCTTAAC CTGTTCACAC 2 0 
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(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: niacleic acid 

5 (C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inea r 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide primer" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
10 AACTCTACTT CCACTTCATC 2 0 

(2) INFORMATION FOR SEQ ID NO: 8:. 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 
J.5 (C) STRANDEDNESS: single 
O (D) TOPOLOGY: linear 

^ (ii) MOLECULE TYPE: other nucleic acid 

p5 (A) DESCRIPTION: /desc = "oligonucleotide primer" 

5 

iy (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

pi 

AO TCAGAACAGA AAGGAATGTG 2 0 



Q (2) INFORMATION FOR SEQ ID NO: 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 
g5 (C) STRANDEDNESS: single 
Lj: ( D) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide primer" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
3 0 GACCACTATT CCACATCACC 2 0 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

3 5 (C) STRANDEDNESS : single 

(D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide primer" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 

4 0 TCTTGCAGTC AGTTGCTTTG CTGG 24 
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(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 11 : 
TAGTGCGGCC GCACATTCCT TTGTGTTC 2 8 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Cys Tyr lie Arg Tyr Lys Ser lie Arg Tyr Pro Glu Thr Met His Ala 
1 5 10 .15 

Lys Asn 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Cys His lie Arg Tyr Lys Ser lie Arg Tyr Pro Asp Thr Thr His Thr 
15 10 15 

Lys Lys 



QBMAD1\1 34023 



